Multiscale principal component analysis
نویسنده
چکیده
Principal component analysis (PCA) is an important tool in exploring data. The conventional approach to PCA leads to a solution which favours the structures with large variances. This is sensitive to outliers and could obfuscate interesting underlying structures. One of the equivalent definitions of PCA is that it seeks the subspaces that maximize the sum of squared pairwise distances between data projections. This definition opens up more flexibility in the analysis of principal components which is useful in enhancing PCA. In this paper we introduce scales into PCA by maximizing only the sum of pairwise distances between projections for pairs of datapoints with distances within a chosen interval of values [l,u]. The resulting principal component decompositions in Multiscale PCA depend on point (l,u) on the plane and for each point we define projectors onto principal components. Cluster analysis of these projectors reveals the structures in the data at various scales. Each structure is described by the eigenvectors at the medoid point of the cluster which represent the structure. We also use the distortion of projections as a criterion for choosing an appropriate scale especially for data with outliers. This method was tested on both artificial distribution of data and real data. For data with multiscale structures, the method was able to reveal the different structures of the data and also to reduce the effect of outliers in the principal component analysis.
منابع مشابه
Novel Feature Extraction for Face Recognition using Multiscale Principal Component Analysis
A method of face recognition based on multiscale principal component analysis (MSPCA) is presented in this paper. Initially face area is extracted from the given face image using Adaboost face detection algorithm. From the face area, regions of interest such as eyes, nose and mouth part are extracted by dividing it along horizontal and vertical directions. Then MSPCA is employed on these region...
متن کاملWavelet Based Multi-Scale Principal Component Analysis for Speech Enhancement
The goal of speech enhancement varies according to specific applications, such as to reduce listener fatigue, to boost the overall speech quality, to increase intelligibility, and to improve the performance of the voice communication device. This paper presents Multiscale principal component analysis (MSPCA) for denoising of single channel speech signal. Principle Component Analysis (PCA) is a ...
متن کاملA Multivariable Statistical Process Monitoring Method Based on Multiscale Analysis and Principal Curves
This study aims to develop an algorithm by integrating multi-resolution analysis (MRA) and principal curves (PC) for monitoring multivariate processes. This may pave the way for handling nonlinear data by means of principal curves in process monitoring area. We succeed in utilizing PC technique for monitoring without the assistance of neural networks, a traditional tool to deal with nonlinear m...
متن کاملComparison of statistical process monitoring methods: application to the Eastman challenge problem
Multivariate statistical process control (MSPC) has been successfully applied to chemical processes. In order to improve the performance of fault detection, two kinds of advanced methods, known as moving principal component analysis (MPCA) and DISSIM, have been proposed. In MPCA and DISSIM, an abnormal operation can be detected by monitoring the directions of principal components (PCs) and the ...
متن کاملPrincipal component analysis or factor analysis different wording or methodological fault?
This article has no abstract.
متن کامل